AD-A104  927 
UNCLASSIFIED 


WISCONSIN  UNIV-MADISON  DEPT  OF  COMPUTER  SCIENCES  F/G  9/2 

PERFORMANCE  ANALYSIS  OF  ALTERNATIVE  DATABASE  MACHINE  ARCHITECTU— ETC(U) 
MAR  0U  P  B  HAWTHORN*  D  J  DEWITT  DAAG29-79-C-0ia2 

CSTR-3U4  NL 


AD  A 1  O  4  9  2 


COMPUTER  SCIENCES 
DEPARTMENT 

University  of  Wisconsin  - 
Madison 


\f  v- - 

i  "l-t 

/  -  . 


..  '.us  boon  approved. 

L .vase  and  scJe:  its  ■ 

•i  ■  animated.  S 


\6j  ^PERFORMANCE  ANALYSIS  OF  ALTERNATIVE 
\‘  DATABASE  MACHINE  ARCHITECTURES, 


)  -  Paula  B. /Hawthorn 


\  David 

Computer  Scienc^Technical 

a ^ - 


wrt  #383 


Performance  Analysis 
o  f 

Alternative  Database  Machine  Architectures 


Paula  B.  Hawthorn* 
David  J.  DeWitt+ 


*Present  Address:  Britton  Lea  Inc.,  900  Santa  Fe ,  Albany,  CA. 

Present  Address:  Computer  Science  Department,  Univ.  of  Wiscon¬ 
sin. 

This  research  was  partially  supported  /by  the  National  Science 
Foundet  ion  under  q  rants  MCS75-33339*  and  MCS7B-31721'1'  and  the 
United  States  Army  Research  Office  under  contracts  #DAAG29-79-C- 
0132",  jiDAAG29-79-C-0155+,  and  SDAAG29-75-C-0024  * . 


The  rapid  advances  in  the  development  of  low-cost  computer 
hardware  have  led  to  many  proposals  for  the  use  of  this  hardware 
to  improve  the  performance  of  database  management  systems.  Usu¬ 
ally  the  design  proposals  are  quite  vague  about  the  performance 
of  the  system  with  respect  to  a  given  data  management  applica¬ 
tion.  In  this  paper  we  predict  the  performance  of  several  of  the 
proposed  database  management  machines  with  respect  to  several 
representative  INGRES  queries.  The  systems  analyzed  in  this 
paper  include  are  associative  disks,  RAP,  CASSM,  DBC ,  DIRECT,  and 
CAFS.  We  demonstrate  that  no  one  database  machine  is  best  for 
executing  all  types  of  queries.  We  will  also  show  that  for  one 
class  of  queries  the  degree  of  performance  improvement  achieved 
does  not  warrant  use  of  a  database  machine. 


I 

I 


cion.  *  19  tracks  per  cylinder).  Table  2.1  summarizes  these  parame- 


meaory  interference.  DCTL,  the  number  of  512-byte 


Cell  processor  performance  By  storing  the  offsets  of  the  attributes  to  be  tested  in  cell- 


reduced  through  the  following  techniques: 


resident,  NDCELL,  Is  assumed  Co  be  19.  Table  2.3  Cell  parameters  and  values 


tlon  time' 


Host  overhead  These  two  values  shall  be  assigned  to  OVCPU  as  best  case  —  worst 


depends  on  the  functionality  of  the  validity  checking:  the 


Each  of  the  machines  relies  on  the  host  to  format  the  ject  operators.  For  example,  several  designs  depend  on  repeatedly 


chines  which  we  shall  examine.  are  calculated  for  each  query  on  each  machine. 


response  ciae  is  the  sub  of  Che  component  tlaes  Chet  cannot  be  3.1.1.  fast  INGRES 
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the  validation  of  the  query.  However 


encoded  in  one  1  byte,  then  since  the  name  is  stored  for  each 


required  for  decoding  can  be  disregarded.  The  value  for  n  for  QTRCOURSE  from  the  disk 


CSCAN  is  the  CCD  page  scan  time*  DPIO  is  the  time  to  read  Since  only  3  tuples  satisfy  this  query,  at  most  3  processors  will 


a  WORST  CASE  TIME  occurs  because  INGRES  maintains  QTRCOURSE  as  a  hashed  relation, 

Q  BEST  CASE  TIME  and  only  reads  three  pages  of  it.  The  caching  systems  (i.e,  RAP 
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building,  room  number,  capacity,  day,  and  hour  of  the 


Che  I/O  cine  Co  read  Che  ROOMS  (RIO)  and  COURSE  (CIO)  relaci. 


DROT/2  •  the  latency  required  to  access  tha  second  track.  Tha  retrieve  froa  the  associativa  disk  the  tuples  In  “COURSE*  with 


subquer las  M.5S  seconds)  dominates  both  the  baat  and  worst  casa  pass  Is  aada  over  the  second  relation  (COURSE)  and  a  second  bit 


the  two  relations  and  then  aerqing  the  two  sorted  lists  to  natch 


to  aack  bit  nap  cell  processor  will  keep  these  22  tuples  in  an  internal  buffer 

to  check  COURSE’S  bit  nap  and  nark  tuple 

to  return  4  attributes  for  the  duration  of  the  query.  Then  each  cell  processor  will 


couplets  the  join.  Six  cell  processors  will  join  their  copy  of  6*CSfAVAC  ♦  2856-DREAD.  Thus: 

tne  22  tuples  l (oak  the  ROOMS  relation  with  eleven  16K  pages  of  CIO  -  lDAVAC+5*CKAVAC+2m*DREAD  —  DAVAC+6*CyAVAC+28S8*DREAD) 


the  COURSE  relation  Is  Initially  loaded,  then  CIO  ■  OAVAC  ♦  1)  fora  sorted,  restricted  projection  of  'ROOMS*,  holding 


1, 


RPROC,  tbo  tints  to  scon  tbo  ROOMS  relation  and  nark  tho 
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Using  the  algorithms  described  above  and  similar  technique 


Two  query  types  previously  defined  are  data- i ntensivt  and  designs  showed  a  significant  performance  improvement  ov*r  the 


ir.gle  relation.  Por  this  case  the  associative  disk  and  D3C 
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[ DEWI 79b}  DeWitt,  D.J.,  "Query  Execution  in  DIRECT",  Proceedin9S 
of  the  ACM-SIGHOD  1979  International  Conference  of  Manage¬ 
ment  of  Data,  May  1979,  pp  13-22. 
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Channel 


puter  are  the  records  with  the  required  values.  As  originally 

designed  (SLOT70],  the  cell  processors  performed  no  arithmetic  Cell  Procesioi 


separate  controlling  processor.  On  the  fixed  head  disk,  each 


records.  Each  record  is  defined  by  delimiter  fields  and  a  record 


the  data  to  the  host  procassor 


transput  tha  data  item  it  has  before  reading  the  next  used  in  this  discussion  is  that  reported  In  (SCHU78] 


controller,  which  would  determine  which  cells  contain  the 


page  into  Its  local  buffer  and  scans  the  page  for  tuples  which 


